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ABSTRACT 

The  successful  evaluation  of  airborne  acoustic  pro- 
cessing systems  requires  the  use  of  well  planned  experi- 
mental designs.   A  good  design  will  enable  the  evaluator 
to  report  results  which  can  be  used  to  predict  system 
performance  in  a  wide  range  of  operating  environments. 
A  method  for  determining  detection  performance  from  the 
results  of  a  controlled  experiment  is  developed.   An 
example  of  the  procedure  for  a  hypothetical  system  is 
presented  with  a  suggested  method  for  comparing  forecast 
detection  performance  with  in-flight  test  results. 
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I.   INTRODUCTION 

The  antisubmarine  warfare  (ASW)  capability  of  the  air 
arms  of  western  navies  depends  upon  their  ability  to  employ 
air-dropable  sonobouys  for  the  detection  and  localization 
of  enemy  submarines.   The  systems  which  process  the  acous- 
tic information  from  these  sonobouys  are  sophisticated  spec- 
trum analyzers.   A  constant  effort  to  improve  performance 
has  led  to  frequent  updates  to  existing  systems  and  the 
periodic  development  of  completely  new  processors. 

Assessing  the  significance  of  these  improvements  in  the 
operational  environment  is  the  function  of  an  operational 
evaluation.   The  operational  evaluation  of  an  acoustic 
processing  system  is  complicated  by  the  impact  of  the  con- 
stantly changing  acoustic  propagation  conditions  encountered 
in  ocean  operating  areas.   This,  along  with  the  inherent 
variability  in  the  other  factors  affecting  passive  sonar 
performance,  makes  it  difficult  to  compare  the  ranges 
achieved  from  one  trial  to  the  next.   However,  it  is  impera- 
tive that  the  evaluation  agencies  report  findings  that  are 
applicable  over  the  wide  variety  of  operating  conditions 
which  are  likely  to  be  encountered  in  the  operational  employ- 
ment of  the  airborne  acoustic  processor. 

It  is  not  sufficient  to  report  detection  ranges  achieved 
during  evaluation  trials.  The  median  detection  range  (MDR) , 
which  is  a  common  method  for  reporting  expected  ranges  in 


operational  situations,  is  not  suitable  for  reporting  the 
detection  capability  of  an  acoustic  system.   The  value  of 
the  MDR  is  valid  for  only  one  combination  of  transmission 
loss,  target  source  level,  ambient  noise,  and  detection 
threshold.   It  is  necessary  to  report  detection  capability 
in  a  manner  which  can  readily  be  adapted  to  any  combination 
of  these  factors.   It  has  been  suggested  by  F .  A.  Andrews 
(Operational  Analysis  Study  Group,  1977)  that  the  figure  of 
merit,  which  is  the  sum  of  the  range  independent  terms  in 
the  passive  sonar  equation,  would  be  an  appropriate  method 
of  measuring  a  processing  system's  detection  capability. 
The  detection  threshold  is  the  only  portion  of  the  figure 
of  merit  which  does  not  change  when  the  target  and  the 
environmental  conditions  are  different.   This  thesis  develops 
a  model  for  accurately  determining  the  expected  detection 
threshold.   The  figure  of  merit  can  be  calculated  using 
this  value  of  detection  threshold  and  available  estimates 
of  source  level  and  ambient  noise  for  use  in  conjunction 
with  transmission  loss  predictions  to  predict  the  system's 
detection  capability  for  a  specific  encounter. 

Constraints  on  time,  target  resources  and  equipment 
make  it  necessary  to  perform  operational  evaluations  in  such 
a  way  as  to  gain  the  maximum  amount  of  reliable  information 
from  the  limited  number  of  encounters  available.   This 
involves  planning  the  evaluation  using  sound  experimenta- 
tion concepts.   Such  a  plan  must  be  cognizant  of  the  variety 


of  factors  which  are  pertinent  to  the  trials  at  hand.   Some 
of  these  experimentation  principles  and  a  number  of  the 
factors  encountered  in  the  operational  evaluation  of  air- 
borne acoustic  processing  systems  are  addressed. 
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II.   EXPERIMENTATION  CONCEPTS 

A.   LEVELS  AND  GOALS  OF  EXPERIMENTATION 

Developing  the  goals  for  an  evaluation  is  an  important 
step  in  the  planning  process.  The  specification  of  objec- 
tives to  be  achieved  will  help  ensure  that  experimentation 
resources  are  focused  upon  the  areas  of  greatest  interest. 
The  most  elaborate  of  test  sequences  will  be  of  limited 
usefulness  if  the  goals  of  the  evaluation  do  not  address 
the  correct  areas  of  interest. 

The  nature  of  the  goals  of  an  evaluation  determine  the 
level  of  experimentation  required.  For  air-dropable  sono- 
bouys  these  goals  might  normally  include: 

1.  Establish  the  capability  of  the  system  to  detect, 
classify,  and  localize  a  submarine. 

2.  Determine  geographic  areas  or  system  operating 
modes  for  which  system  is  degraded  and  establish 
alternate  procedures,  enabling  operators  to  work 
around  the  degraded  modes. 

3.  Exercise  proposed  operating  procedures  and  tactics 
and  suggest  improvements. 

The  level  of  experimentation  refers  to  the  degree  of 
formalism  involved.   For  some  types  of  goals  a  loosely 
structured  scenario  with  results  reported  in  a  subjective 
manner  will  suffice.   Meeting  other  goals  may  require  a 
highly  structured  test  environment  and  close  control  of  the 
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factors  involved.   Barr,  Poock,  and  Richards  (1978)  iden- 
tify four  bands  of  experimentation  formalism. 

1.  Validation  Experiments 

This  type  of  experiment  requires  the  lowest  order  of 
formalism.   The  goal  of  a  validation  experiment  might  be  to 
determine  the  feasibility  of  operating  a  system  within 
specifications.   This  level  of  experimentation  would  normally 
be  expected  in  developmental  evaluations. 

2 .  Demonstration  Experiments 

Goals  of  experiments  of  this  type  would  involve  the 
demonstration  of  a  system's  capabilities  within  a  loosely 
structured  scenario  or  in  the  performance  of  a  given  set  of 
activities.   The  results  of  such  a  demonstration  would  not 
normally  be  the  subject  of  a  formal  analysis. 

3 .  Assessment  Experiments 

The  objective  of  assessment  experiments  would  be  to 
gain  an  idea  of  how  well  a  system  works  over  a  wide  range  of 
conditions.   Little  control  over  the  sources  of  error  would 
be  expected  and  usually  the  data  from  the  experiment  would 
include  only  the  subjective  opinions  of  those  involved  in 
or  observing  the  trials. 

4 .  Evaluation  Experiments 

This  most  formally  structured  type  of  experiment 
requires  careful  recording  of  experimental  conditions.   The 
data  collected  would  normally  be  numerical,  and  in  a  format 
to  facilitate  formal  analysis. 
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For  an  acoustic  processing  system  the  areas  of 
investigation  regarding  operating  procedures  or  the  human 
factors  involved  in  the  man-machine  interface  may  best  be 
served  by  subjective  assessment.   Performance  in  these  areas 
is  difficult  to  quantify  or  measure  making  the  opinions  of 
those  involved  in  operating  the  equipment  or  in  exercising 
the  operating  procedures  the  most  useful  information  for 
these  areas  of  investigation.   A  questionnaire  could  be 
developed  to  insure  that  the  subjective  assessments  are 
collected  in  a  format  which  can  be  readily  summarized  and 
evaluated. 

Establishing  the  specific  capabilities  of  a  system 
will  require  a  more  rigorous  experimental  approach  involving 
a  number  of  replications  under  carefully  recorded  conditions 
In  this  case  the  trials  must  be  followed  by  a  formal  analy- 
sis of  results.   The  analyses  of  results  in  the  areas  of 
detection,  classification  and  localization  should  normally 
be  based  on  numerical  data  and  not  on  subjective  assessments 
The  more  formal  level  of  experimentation  will  enable  the 
evaluation  to  report  the  system's  capabilities  in  measurable 
terms  such  as  decibels  or  nautical  miles.   These  numerical 
results  will  allow  their  use  in  conjunction  with  available 
models  to  predict  system  performance  in  situations  not 
directly  assessed,  but  within  the  scope  of  the  experiment. 

B.   SCOPE  OF  EXPERIMENT  AND  CREDIBILITY  OF  RESULTS 

The  evaluation  of  a  system  which  will  be  employed  in  a 
broad  spectrum  of  environments  and  scenarios  requires  that 


13 


difficult  decisions  be  made  in  determining  its  scope.   In 
all  instances  constraints  on  resources  will  require  a  deli- 
cate balance  between  the  scope  of  the  experiment  and  the 
degree  of  confidence  achievable  in  specific  results.   The 
initial  draft  of  a  test  plan  should  set  out  objectives  for 
the  evaluation  which  include  covering  a  broad  range  of 
areas  of  investigation.   Subsequent  refinements  to  the  test 
plan  will  require  compromise  between  the  scope  of  the  objec- 
tives and  available  resources.   Where  possible  preliminary 
experiments  and  calculations  should  be  used  to  gain  insight 
into  those  variables  which  will  impact  on  the  results  achieved 
in  field  trials.   These  preliminary  studies  will  aid  the 
test  planners  in  deciding  what  range  of  scenarios  is  dic- 
tated and  where  resources  will  be  most  efficiently  expended. 
Resources  should  be  concentrated  in  areas  where  there  is  a 
high  probability  of  achieving  meaningful  results.   The 
scope  should  not  be  narrowed,  however,  to  a  point  where 
results  no  longer  have  a  useful  range  of  applicability.   For 
example  a  thorough  investigation  of  the  system's  performance 
in  a  specific  operating  area  will  not  provide  insight  into 
the  effects  of  operating  in  another  location  where  environ- 
mental conditions  are  substantially  different.   At  the  same 
time  an  attempt  to  encompass  too  many  different  areas  will 
lead  to  a  paucity  of  observations  for  determining  with  any 
credibility  the  performance  in  any  of  the  locations. 


14 


A  more  reasonable  approach  would  be  to  select  a  small 
number  of  operating  areas  which  span  the  environmental 
conditions  in  which  the  system  will  operate.   By  picking 
one  area  that  is  perceived  as  most  favorable  to  the  system 
and  one  or  two  others  that  are  thought  to  be  less  promising 
the  results  can  be  expected  to  show  to  what  degree  the 
varying  conditions  affect  the  capabilities  of  the  system. 

C.   FACTORS  AND  LEVELS 

Many  of  the  factors  which  determine  the  performance  of 
a  passive  acoustic  system  are  summarized  in  the  passive 
sonar  equation  [Urick,  1975J 

SL  -  TL   =   NL  -  DI  +  DT 

where  SL  is  the  source  level,  TL  is  the  transmission  loss, 

NL  is  the  noise  level,  DI  is  the  directivity  index  and 

DT  is  the  detection  threshold.   These  are  discussed  below. 

1 .   Source  Level  (SL) 

SL  is  defined  as  the  intensity  of  the  radiated  sound 

in  decibels  relative  to  the  intensity  of  a  plane  wave  of 

root-mean-square  pressure  of  one  micropascal  (uP  )  referred 

a 

to  a  point  one  yard  (or  meter)  from  the  acoustic  center  of 
the  projector.   The  SL  may  be  under  the  control  of  the 
evaluator  if  the  target  is  a  towed  projector  or  a  noise 
augmented  submarine.   In  the  case  of  a  non-augmented  sub- 
marine the  SL  depends  not  only  on  the  particular  boat,  but 
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also  on  the  signature  source  which  is  detected.   In  either 
case  it  is  important  that  an  accurate  determination  of  the 
SL  be  recorded.   This  should  include  an  investigation  of 
any  aspect  dependency.   Submarines  are  known  to  have  lobe 
patterns  in  the  horizontal  and  there  is  no  reason  to  expect 
that  towed  projectors  or  noise  augmentors  will  not  exhibit 
similar  characteristics.   A  record  should  be  kept  if  any 
changes  in  SL  are  made  during  the  evaluation  by  operating 
mode  changes  or  other  factors. 
2.   Transmission  Loss  (TL) 

The  impact  of  the  environment  on  the  performance 
of  the  system  is  indicated  by  transmission  loss.   The  propa- 
gation loss  term  is  one  of  the  aspects  thatmakes  the  test 
and  evaluation  of  sonar  systems  in  operational  environments 
difficult.   Rarely  can  the  losses  be  measured.   They  can 
be  indirectly  calculated  by  using  a  routine  such  as  ICAPS 
[Pennylegion,  1977]  in  connection  with  bathythermograph  informa- 
tion recorded  at  the  time  of  the  test. 

The  oceanographic  factors  which  determine  TL  cannot 
be  assumed  to  be  homogeneous  over  the  entire  area  of  a 
particular  trial.   Bathythermographs  from  the  vicintiy  of 
the  target  and  the  detecting  sonobouy  should  be  compared. 
If  the  traces  exhibit  significant  differences  a  more  sophis- 
ticated model  will  be  required  to  accurately  determine  the 
TL.   A  parabolic  equation  model  such  as  BTL  -  1  [NSIA,  1974] 
will  accommodate  more  than  one  sound  speed  profile  along  the 
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transmission  path.   Trials  conducted  in  the  areas  of  major 
currents,  frequent  eddies,  or  oceanographic  fronts  will 
experience  rapid  changes  in  transmission  paths  over  rela- 
tively short  horizontal  distances. 

Care  should  also  be  exercised  in  assuming  temperature 
profile  information  will  be  valid  over  extended  time  periods. 
Effects  such  as  daytime  heating,  rain  storms,  or  wave  action 
may  make  the  conditions  observed  at  the  time  of  a  bathy- 
thermograph short  lived.   Consultation  with  forecasters 
familiar  with  the  operating  areas  and  a  study  of  pertinent 
satellite  infrared  data  should  provide  a  feel  for  the  fre- 
quency and  scale  of  fluctuations  to  be  expected. 
3 .   Noise  Level  (NLj 

The  ambient  noise  at  the  hydrophone  together  with 

the  self  noise  of  the  system  when  expressed  as  the  average 

plane  wave  acoustic  intensity  in  a  specified  one  hertz 

frequency  band  relative  to  a  reference  intensity  of  one  yP^ 

a 

make  up  the  background  noise  against  which  the  system  must 
detect  a  signal.   For  modern  passive  acoustic  systems 
ambient  noise  is  the  dominant  factor.   Ambient  noise,  which 
is  principally  a  function  of  sea  state,  frequency,  propaga- 
tion conditions,  and  local  and  distant  shipping  is  a  fluc- 
tuating phenomenon.   Measurement  of  the  intensity  and 
variability  of  the  ambient  noise  should  be  made  as  close  to 
the  location  of  the  detecting  sonobouy  and  as  near  to  the 
time  of  detection  as  possible.   Any  directionality  in  the 
ambient  noise  levels  should  be  noted. 
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4.  Directivity  Index  (PI) 

This  is  a  measure  of  the  ability  of  an  array  to 
discriminate  against  noise  in  the  horizontal  or  vertical 
plane.   Omnidirectional  hydrophones  exhibit  no  directivity. 
Differences  in  DI  will  not  be  included  in  examples  which 
follow  although  for  systems  where  appropriate,  this  exten- 
sion would  be  straightforward. 

5.  Detection  Threshold  (DT) 

Detection  threshold  is  the  ratio  of  the  signal  power 
in  the  received  bandwidth  to  the  noise  power  in  a  1  Hz  band 
required  for  detection  at  some  preassigned  level  of  proba- 
tility  of  detection  at  a  fixed  rate  of  false  alarm.   DT  is 
expressed  in  decibel  units. 

The  detection  decision  involves  the  decision  by  the 
operator  as  to  whether  there  is  or  is  not  a  target  present. 
The  decision  is  correct  if  the  operator  declares  a  target  is 
present  when  there  is  a  target  or  when  the  operator  makes 
no  target  call  when  there  is  no  target  present.   The  deci- 
sion is  incorrect  either  when  the  operator  declares  a  target 
in  the  absence  of  one  or  fails  to  call  a  target  which  is 
present.   For  passive  acoustic  systems  DT  is  usually  quoted 
for  a  probability  of  detection  of  fifty  percent  and  a  false 
alarm  probability  of  one  in  ten  thousand.   For  modern  pro- 
cessors where  the  bandwidth  is  small  the  large  number  of 
frequency  cells  being  observed  at  any  time  results  in  false 
alarm  rates  of  the  order  of  one  per  hour.   The  DT  term  in 
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the  passive  equation  is  important  to  the  system  evaluator 
because  it  describes  the  detection  performance  of  the  man- 
machine  system  and  is  not  range  dependent. 

By  combining  the  DT,  AN,  DI,  and  SL  a  figure  of 
merit  (FOM)  for  the  encounter  is  calculated: 

FOM   =   SL  -  (AN  -  DI)  -  DT  . 

The  figure  of  merit  is  not  dependent  upon  TL  but 
can  be  used  in  conjunction  with  predicted  TL  to  determine 
the  expected  detection  performance  of  an  acoustic  system. 
6 .   Other  Factors 

The  factors  affecting  system  performance  which  do 
not  specifically  appear  in  the  passive  sonar  equation  include 
equipment  reliability,  target  signature  characteristics, 
and  the  tactics  employed  by  both  the  aircrew  and  the  sub- 
marine commander.   Determining  the  reliability  of  equipment 
under  evaluation  will  undoubtably  be  one  of  the  program 
goals.   The  impact  of  equipment  breakdowns  or  malfunctions 
on  data  collection  should  also  be  of  concern  to  those  per- 
forming the  trials.   If  a  balanced  experiment  is  planned, 
the  failure  to  obtain  results  for  a  given  set  of  conditions 
may  make  future  analyses  extremely  difficult.   The  variety 
of  submarine  acoustic  signature  characteristics  which  could 
affect  detection  cannot  be  discussed  in  this  unclasified 
thesis.   Test  planners  should  be  aware  of  them  and  ensure 
that  they  do  not  introduce  confusion  into  test  results. 
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The  tactics  employed  will  determine  such  factors  as 
the  number  of  detection  opportunities  in  a  trial  and  be 
critical  for  localization  once  detection  has  been  achieved. 
In  most  cases  the  tactics  employed  will  follow  the  guidelines 
indicated  by  established  doctrine.   However,  the  nature  of 
ASW  encounters  make  deviations  from  routine  tactics  in  a 
free-play  situation  likely.   In  order  to  control  encounters 
and  avoid  wasting  aircraft  and  submarine  resources,  opera- 
tional evaluations  usually  require  that  the  tactical  situa- 
tion be  controlled  by  the  test  director.   The  effects  of 
these  forced-encounter  scenarios  may  be  difficult  to  quan- 
tify.  However,  careful  planning  should  keep  them  to  a 
minimum.   For  example,  if  repeated  runs  are  planned  for  a 
certain  type  of  encounter,  care  should  be  taken  to  ensure 
that  the  players  do  not  use  knowledge  gained  in  earlier 
runs  to  give  them  an  artificial  advantage  in  the  later 
trials. 

D.   MEASURES  OF  EFFECTIVENESS  (MOE) 

The  measures  of  effectiveness  used  in  the  evaluation  of 
an  airborne  acoustic  processing  system  should  meet  the 
following  basic  requirements  adapted  from  [Rau,  1974] : 

1.  They  must  directly  relate  to  how  well  the  specific 
objectives  of  the  evaluation  are  met. 

2.  They  should  be  relevant  to  the  mission  or  operational 
roles  of  interest. 
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3.  They  should  be  precisely  defined  and  expressed  in 
terms  meaningful  to  the  decision  maker  in  order  to 
prevent  decision  makers  from  misunderstanding  their 
operational  implications. 

4.  They  must  be  stated  in  terms  of  inputs  that  can  be 
measured. 

The  specific  objectives  of  establishing  the  capability  of 
an  acoustic  system  in  the  areas  of  detection,  classifica- 
tion/ and  localization  are  discussed  below. 
1.   Detection 

A  suitable  measure  of  effectiveness  would  be  to 
report  the  probability  of  detecting  a  target  with  a  given 
SL  in  a  specified  ocean  area  under  prevailing  conditions 
of  sound  propagation  and  ambient  noise.   To  present  this 
the  passive  sonar  equation  dictates  the  determination  of  an 
accurate  and  achievable  detection  threshold  and  directivity 
index.   The  other  factors  involved  are  generally  either  func- 
tions of  the  environment  or  of  the  target  and  can  be  assumed 
to  be  either  known  or  measurable.   If  the  DI  and  DT  for 
the  various  modes  of  operation  of  the  processor  are  known 
over  the  range  of  frequencies  of  interest,  they  can  be 
combined  with  the  target's  SL  and  a  measure  of  AN  to  calcu- 
late the  FOM  for  an  encounter.   The  FOM  can  then  be  used  in 
conjunction  with  acoustic  forecasts  and  tactical  considera- 
tions to  compute  the  desired  probability  of  detection. 
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2 .  Classification 

It  is  possible  that  even  though  detections  are  made 
the  processing  system  does  not  provide  the  operator  with 
the  necessary  information  to  allow  a  high  proportion  of 
correct  classifications.   A  measure  of  the  effectiveness 
of  the  operator  -  processor  combination  in  classifying  tar- 
gets correctly  could  take  one  of  two  forms.   Perhaps  the 
simplest  approach  would  be  to  ask  the  operators  to  compare 
the  ease  of  classification  using  the  system  under  evaluation 
with  the  ease  of  using  previous  systems.   As  correct  classi- 
fication is  largely  a  function  of  operator  experience  and 
ability  this  procedure  could  provide  a  useful,  though  sub- 
jective, measure  of  effectiveness.   A  more  objective  measure 
would  involve  determining  the  probability  of  correct  classi- 
fication for  various  targets  at  given  signal  to  noise  ratios. 
This  would  probably  require  an  extensive  experiment  utilizing 
tape  recorded  target  signatures  and  a  selection  of  represen- 
tative operators. 

3 .  Localization 

Appropriate  measures  of  the  effectiveness  of  a 
system  in  the  localization  phase  would  be  expected  to  allow 
the  calculation  of  the  probability  of  localization  to  within 
specified  criteria.   The  probability  of  successful  localiza- 
tion would  mainly  be  a  function  of  detection  ranges  during 
the  encounter,  bearing  accuracies,  time  allowed  for  localiza- 
tion, aircrew  tactics,  and  submarine  characteristics  and 
tactics . 
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III.   EXPERIMENTAL  DESIGN 

A.   TEST  PLANNING  AND  ESTABLISHED  DESIGN  PRINCIPLES 

It  is  particularly  important  in  operational  test  and 
evaluation  to  prepare  a  comprehensive  test  plan.   The  com- 
plexity of  the  systems  under  investigation  and  the  number 
of  factors  which  are  likely  to  affect  performance  make  it 
imperative  that  adequate  time  and  effort  be  devoted  to  the 
preparation  of  a  test  plan.   A  good  test  plan  will  ensure 
that  the  maximum  amount  of  useful  information  can  be  gained 
from  test  results. 

A  balanced  test  design,  wherein  data  are  collected  for 
each  combination  of  the  experimental  factors  of  interest, 
is  desired  to  make  the  analysis  of  results  straightforward. 
There  are  numerous  well  documented  approaches  to  the  analy- 
sis of  balanced  experiments.   Determining  the  effects  on 
performance  of  the  different  levels  of  experimental  factors 
from  data  collected  in  unbalanced  situations  often  proves 
challenging  to  even  the  most  experienced  analysts.   If  the 
unbalance  is  extreme,  some  of  the  effects  may  not  be  esti- 
mable.  In  the  realm  of  operational  test  and  evaluation  it 
is  sometimes  impossible  to  achieve  complete  balance.   The 
test  planner  should  recognize  this  and  be  able  to  show  in 
advance  of  the  experimental  runs  that  his  proposed  design 
will  allow  the  required  analysis  even  when  some  of  the 
planned  test  runs  are  either  cancelled  or  invalidated. 
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Another  function  of  the  test  planning  phase  of  an 
experiment  is  to  designate  the  order  and  conditions  for  the 
test  runs  in  a  manner  which  avoids  the  confounding  of 
results.   Confounding  occurs  when  independent  variables 
are  allowed  to  vary  at  the  same  time  from  trial  to  trial. 
An  example  of  this  would  occur  if  in  an  evaluation  comparing 
two  types  of  sonobouys,  the  trials  for  one  type  of  bouy  were 
scheduled  for  morning  sorties  and  those  for  the  other  type 
scheduled  in  the  afternoon.   Any  difference  in  observed 
performance  due  to  the  type  of  bouy  would  be  inseparable 
from,  or  confounded  with,  the  effect  due  to  the  differences 
in  experimental  conditions  from  morning  to  afternoon.   A 
good  understanding  of  the  variables  which  are  likely  to 
affect  system  performance  and  the  practice  of  established 
experimental  design  principles  such  as  randomization  or 
blocking  will  enable  the  test  planner  to  avoid  the  confounding 
of  results. 

B.   DESIGNING  FOR  OPERATIONAL  REALISM 

In  the  realm  of  operational  test  and  evaluation  there 
is  an  underlying  requirement  that  trials  be  conducted  in 
an  operationally  realistic  environment  [Munroe,  1976] .  This  pre- 
sents the  test  planner  with  a  seeming  paradox.   On  the  one 
hand  he  is  asked  to  determine  the  performance  of  the  system 
under  evaluation  in  realistic  scenarios.   On  the  other  hand 
it  is  required  that  he  report  results  which  can  be  supported 
either  by  measures  of  statistical  significance  or  by 
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reproducability .   The  sheer  number  of  factors  which  could 
affect  system  performance  in  an  operational  scenario  makes 
the  design  of  a  "free  play"  experiment  to  establish  all  of 
their  individual  significances  untenable.   The  test  planner 
must  decide  upon  how  to  reduce  the  complexity  of  the  evalua- 
tion without  seriously  reducing  its  applicability  to  future 
operations . 

One  approach  to  the  solution  of  this  problem  is  to  use 
preliminary  laboratory  type  studies  to  estimate  the  effects 
of  the  experimental  factors.   The  accuracy  of  these  estimates 
can  then  be  verified  by  comparison  with  a  selection  of 
trials  made  in  a  variety  of  operational  environments.   If 
the  results  predicted  from  the  laboratory  studies  are  borne 
out  by  the  field  trials,  the  models  indicated  by  the  labora- 
tory work  can  be  used  with  confidence  over  the  realm  of 
operational  environments.   An  example  of  this  approach  will 
be  given  for  determining  the  detection  threshold  of  an 
acoustic  processing  system  at  a  later  point  in  this  thesis. 

Other  approaches  to  the  problem  of  balancing  operational 
realism  against  experimental  control  would  include  one  which 
is  somewhat  the  reverse  of  the  above.   Repeated  runs  of  small 
trials,  representing  portions  of  the  tactical  evolution  of 
interest  could  be  utilized  to  gain  accurate  estimates  of 
the  parameters  required  by  the  analyst  in  modelling  the 
complete  encounter.   The  analyst  could  then  use  these  parameters 
as  inputs  to  a  computer  simulation.   This  method  allows  the 
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investigation  of  system  performance  in  areas  such  as  target 
localization  where  the  multitude  of  tactical  decisions  which 
influence  results  would  make  a  thorough  at  sea  evaluation 
impossible.   The  study  of  target  localization  performance 
utilizing  sonobouys  and  an  acoustic  system  would  require 
that  estimates  of  such  parameters  as  the  bearing  accuracy 
in  the  difar  mode  be  established  by  at  sea  trials.   The 
use  of  a  computer  simulation  would  then  allow  the  study  of 
how  well  the  system  would  be  able  to  localize  a  target  for 
a  variety  of  tactical  approaches. 

C.   TEST  PLAN  REHEARSAL 

A  useful  procedure  for  ensuring  that  the  test  plan  and 
experimental  design  for  a  given  phase  of  an  operational 
evaluation  will  provide  the  desired  information  is  to  per- 
form a  rehearsal  on  paper.   The  rehearsal  need  not  be  elabor- 
ate but  should  include  all  of  the  steps  in  the  test  plan 
including  the  analysis  of  fictitious  data  and  drawing  con- 
clusions from  the  analysis.   The  data  should  be  developed 
from  a  combination  of  best  estimates,  contractor  specifica- 
tions and  previous  test  results.   By  generating  the  obser- 
vations in  an  intelligent  manner  the  test  planner  can 
determine  if  factor  effects  of  a  given  magnitude  will  be 
detected  by  the  test  plan.   The  test  planner  can  play  "what 
if"  to  check  out  the  impact  of  missing  observations  or 
other  eventualities  during  the  evaluation. 
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The  procedures  for  carrying  out  a  paper  rehearsal  are: 

1.  Estimate  the  expected  values  of  the  variables  to 
be  observed  during  the  trial  runs . 

2.  Estimate  the  variation  expected  for  the  variables 
estimated  above.   A  convenient  way  to  estimate 
variance  is  to  estimate  the  expected  range  for 
the  observations  and  divide  by  five  to  arrive  at 

an  approximate  standard  deviation  [Del  Priore,  1979J  , 

3.  Generate  the  fictitious  data  by  selecting  random 
numbers  from  the  appropriate  normal  distribution. 
The  normal  distribution  is  not  critical,  but  is 
probably  a  reasonable  choice  in  most  circumstances. 

4.  Add  or  subtract  any  factor  effects  for  each  data 
point.   The  factor  effects  should  be  representative 
of  those  which  the  test  planner  would  hope  to  detect 
in  the  evaluation. 

5.  Analyse  the  data  in  the  same  manner  as  the  actual 
data  will  be  analysed  after  the  evaluation. 

6.  Draw  conclusions  from  the  analysis  thereby  ensuring 
that  the  expected  conclusions  can  indeed  be  achieved 
from  the  data  and  analysis  as  planned. 

7.  Critique  the  results  of  the  rehearsal  by  comparing 
the  results  of  the  analysis  with  the  parameters  used 
to  generate  the  data. 

8.  Make  any  indicated  alterations  to  the  test  plan  and 
repeat  the  paper  rehearsal  if  the  magnitude  or  the 
number  of  changes  warrant  it. 
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An  example  of  a  paper  rehearsal  of  a  test  plan  for 
evaluating  the  detection  capability  of  an  acoustic  processing 
system  is  incorporated  in  Section  IV  of  this  thesis. 
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IV.   MODEL  FOR  DETERMINING  DETECTION  CAPABILITY 

A.  RATIONALE 

The  detection  performance  of  an  acoustic  processing 
system  is  effectively  determined  once  a  realistic  determina- 
tion of  the  detection  threshold  achieved  by  the  system  in  the 
operational  environment  has  been  determined.   The  DT  can  be 
utilized  to  calculate  the  FOM  for  a  given  encounter  and  the 
FOM  can  be  used  in  conjunction  with  TL  forecasts  by  routines 
such  as  LATRAN  from  the  ICAPS  package  to  construct  lateral 
range  curves  of  probability  of  detection  versus  range  (for 
example,  see  Fig.  1).   These  curves  provide  decision  makers 
with  an  operationally  meaningful  assessment  of  the  system's 
effectiveness.   The  model  developed  in  this  section  provides 
a  method  for  establishing  the  DT  of  an  acoustic  processing 
system.   A  method  is  also  developed  for  verifying  the  per- 
formance of  the  model  against  results  achieved  in  the  air- 
borne trials  of  an  operational  evaluation. 

B.  THE  DT  MODEL 

The  achieved  detection  threshold  (DT  )  is  assumed  to  be 

a 

effectively  modelled  by  a  linear  function  of  the  form: 


DT    =   DT   +■  B  +  M.  +  OP.  +  EQ.  +  (MOP)  .   +  (MWQ)  .,  +  e.  .. 
a       c         1      ]      k        13         lk     13k 
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where 

DT  =  the  calculated  detection  threshold 
c 

B  =  the  overall  bias 

M.  =  the  effect  due  to  mode  i 

OP .  =  the  effect  due  to  operator  j 


EQV       =   the  effect  due  to  equipment  of  serial 
number  k 


^k 


(MOP),.  _.   =   the  interaction  effect  due  to  mode  i 
and  operator  j 


ID 


(MEQ) ..   =   the  interaction  effect  due  to  mode  i 
and  equipment  k 

e-  .,      =   the  interaction  effect  due  to  mode  i 
and  equipment  k 


The  (OPEQ) .,  interaction  and  the  three  way  interaction  have 

been  assumed  to  be  negligable. 

The  DT   term  to  be  used  is  calculated  from  first  princi- 
c  ^ 

pies  as  described  by  Pryor  (1971).  However,  if  the  manu- 
facturer has  supplied  a  reasonable  estimate  of  DT  for  the 
frequency  and  mode  of  interest,  this  could  be  used  in  the 
place  of  DT^.  It  is  stressed  that  DT  will  vary  according 
to  the  integration  time,  bandwidth,  and  other  factors.  The 
nature  of  the  particular  processing  system  will  determine 
the  precise  way  in  which  DT   is  calculated.   The  standard 

figures  of  a  fifty  percent  probability  of  detection  and  a 

-4 

probability  of  false  alarm  of  10    should  be  used.   A 

different  value  for  the  probability  of  false  alarm  could 
be  used  without  altering  the  procedure  provided  it  is  used 
consistently  throughout. 
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C.   ESTIMATION  OF  FACTOR  EFFECTS 

The  effects  on  detection  performance  of  the  modes, 
operators,  and  equipments  can  be  estimated  from  the  results 
of  a  controlled  experiment.   Operators  of  acoustic  processors 
being  run  in  aircraft  on  ground  power  are  fed  known  signals 
in  background  noise.   The  signals  occurring  at  random  fre- 
quencies and  fixed  signal  to  noise  ratios  in  the  vicinity 
of  the  expected  DT  are  detected  by  the  operators  with  a 
certain  probability  of  detection  and  false  alarm.   The 
signal  levels  are  then  adjusted  by  the  methods  of  Pryor 

(1971)  to  a  DT   for  a  fifty  percent  probability  of  detection 
a 

and  a  previously  established  false  alarm  rate.   The  corres- 
ponding values  of  DT   are  then  subtracted  from  the  DT 

c  a 

figures  to  determine  the  ADT  observation  for  the  experimental 

conditions  of  the  test. 

The  preceding  equation  for  DT   can  be  rearranged  to  define 

a 

ADT  as  the  difference  between  DT   and  DT  . 

a       c 


ADT   =   DT   -  DT    =   B  +  M .  +  OP .  +  EQ,  +  (MOP) . . 
a      c  1      3      k         13 


+  (MEQ)  .,  +  e.  .. 
v  lk     13k 


A  balanced  factorial  design  combining  each  mode,  opera- 
tor, and  equipment  should  be  conducted.   The  experiment  should 
be  carried  out  according  to  an  appropriate  experimental  plan, 
such  as  a  Latin  square,  in  order  to  eliminate  any  possible 
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confounding  due  to  the  order  of  observation.   The  factor 
levels  used  will  depend  upon  the  particular  evaluation  at 
hand.   The  design  of  the  signal  processor  will  determine 
what  modes  of  operation  are  pertinent.   The  operators  par- 
ticipating in  the  experiment  would  ideally  be  all  of  those 
designated  to  operate  the  equipment  in  the  airborne  phases 
of  the  trials.   If  this  is  not  possible  the  operators  used 
should  be  selected  at  random  from  those  available.   Similar 
criteria  should  be  used  to  select  the  particular  processors 
(as  identified  by  serial  number)  to  be  used. 

Data  from  the  experiment  are  analysed  by  an  analysis  of 
variance  (ANOVA)  procedure.   This  will  identify  any  signi- 
ficant factor  or  interaction  effects.   For  the  effects  which 
prove  significant  the  parameters  to  be  used  in  the  model  are 
estimated.   The  most  common  method  of  estimating  the  effects 
is  the  method  of  least  squares.   The  method  will  not  be 
stated  here,  but  is  covered  in  detail  in  almost  all  statis- 
tics texts  (for  example,  see  DeGroot,  1975). 

D.   COMPARISON  OF  DETECTION  RESULTS  TO  THE  MODEL 

The  detection  thresholds  determined  by  the  model  can  be 
compared  with  results  achieved  during  evaluation  flights. 
For  each  detection  event  during  the  operational  evaluation, 
a  record  of  the  frequency,  mode,  operator,  wave  height, 
equipment  serial  number,  sono  depth,  local  bathythermograph, 
ambient  noise,  time  of  detection  and  bottom  type  and  depth 
is  made.   The  reconstruction  of  the  mission  will  determine 


33 


the  target's  range,  aspect,  depth  and  source  level  at  the 
time  of  detection.   The  transmission  loss  for  the  detection 
range  is  determined  from  an  ICAPS  run  for  the  appropriate 
conditions.   The  detection  threshold,  as  determined  from 
the  model  and  the  observed  ambient  noise,  source  level  and 
directivity  index  establish  the  figure  of  merit  for  the 
encounter.   The  signal  excess  (SE)  at  detection  is  calcu- 
lated by  subtracting  the  TL  at  the  detection  range  from 
the  FOM. 

These  computations  are  made  for  each  detection  event 
occuring  in  a  given  ocean  region  during  the  operational 
evaluation.   The  definition  of  DT  states  that  it  is  the 
value  of  the  signal  to  noise  ratio  required  for  a  fifty 
percent  probability  of  detection.   From  this  definition 
it  follows  that  the  median  SE  at  detection  should  be  zero. 
DeGroot  (1975)  gives  a  hypothesis  test  procedure  which  can 
be  used  to  determine  if  the  values  of  SE  observed  are  likely 
to  have  a  median  value  significantly  different  from  zero. 
An  example  of  a  paper  rehearsal  of  this  procedure  follows. 

When  the  hypothesis  test  indicates  a  SE  significantly 
different  from  zero,  there  are  two  possibilities.   If  the 
observed  median  is  negative,  the  most  likely  cause  is  that 
the  TL  model  has  provided  losses  which  are  overly  pessimis- 
tic.  For  example,  studies  by  Christenson,  Frank,  and  Geddes 
(1975)  indicate  that  low  frequency  bottom  transmission  paths 
are  less  dependent  upon  bottom  type  than  previously  believed 
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For  some  operating  areas  this  may  make  it  necessary  to  use 
fictitious  bottom  type  inputs  in  order  to  achieve  valid  TL 
estimates.   This  type  of  adjustment  should  be  necessary 
only  for  detection  ranges  where  bottom  bounce  contributions 
are  significant.   These  detections  would  usually  occur  at 
ranges  beyond  the  expected  direct  path  range. 

Values  of  SE  which  result  in  a  median  value  greater 
than  zero  indicate  that  the  modelled  DT  values  have  not  been 
achieved.   This  would  happen  when  higher  values  of  the 
signal  to  noise  ratio  than  expected  are  required.   A  de- 
tailed analysis  of  the  conditions  under  which  the  individual 
detections  took  place  should  be  made  with  the  goal  of  finding 
any  correlations  between  particular  factors  and  the  degraded 
detection  performances.   If  the  investigation  fails  to  dis- 
cover a  likely  explanation,  an  adjustment  to  the  model  is 
indicated.   The  adjustment  would  take  the  form  of  an  addi- 
tional parameter  equal  to  the  observed  median  SE  value . 
This  parameter  would  adjust  the  model  for  the  observed 
reduction  in  detection  capability  in  moving  from  the  ground 
test  environment  to  airborne  trials . 

E.   TEST  PLAN  REHEARSAL  FOR  DETERMINING  DETECTION  THRESHOLD 
1 .   Test  Design 

The  following  rehearsal  will  indicate  how  the  above 
procedure  would  be  carried  out  for  a  hypothetical  processing 
system.   The  equipment  has  five  modes  which  might  be  used 
in  a  detection  scenario  (see  Table  I) .   These  modes  are 
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expected  to  achieve  varying  DT  figures.   There  are  five 
operators  who  have  been  trained  in  the  use  of  the  system 
and  they  are  expected  to  be  assigned  to  the  evaluation  for 
its  duration.   The  evaluation  unit  has  been  assigned  three 
fleet  aircraft  for  use  during  the  evaluation.   The  test 
planner  has  assumed  that  the  three-way  interaction  effect 
of  the  modes,  operators  and  aircraft  is  not  likely  to  be 
significant  and  also  that  there  will  be  no  two-way  inter- 
action between  the  operator  and  the  equipment.   It  is  decided 
that  a  5  x  5  x  3  factorial  design  will  allow  the  estimat- 
tion  of  the  required  model  parameters.   The  order  of  the 
trials  is  laid  out  in  an  incomplete  block  design  as  indi- 
cated in  Table  I,  in  an  attempt  to  prevent  any  confounding 
of  a  possible  learning  effect  with  the  main  effects  of  the 
experiment. 

The  frequency  of  each  detection  is  recorded.   For 
some  systems  the  analyst  may  feel  that  there  is  an  effect 
due  to  frequency  which  has  not  been  adequately  modelled  by 
the  DT   calculation.   In  these  cases  it  would  be  appro- 
priate to  include  the  frequency  as  a  covariable  in  the 
analysis  of  variance.   For  this  example  it  is  assumed  that 
any  frequency  effect  has  been  effectively  removed. 
2.   Data  Generation 

The  test  planner  decides  from  preliminary  testing 
that  the  various  modes  of  operation  are  likely  to  achieve 
the  calculated  DT   plus  a  bias  with  mean  (y)  as  given  in 
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Table  II.   The  planner  assumes  that  the  variability  in 
observed  DT  can  be  represented  by  the  standard  deviations 
(a)  listed  in  this  table.   The  five  operators  involved  in 
the  trial  were  assumed  from  past  performances  to  have 
the  effects  listed  in  Table  III  on  DT.   The  three  particular  pro- 
cessors available  for  the  trials  are  a,  b,  and  c.   Equip- 
ment b  is  assumed  to  be  operating  degraded  by  .5  dB. 

The  observations  for  the  experiment  were  generated 
by  selecting  a  normally  distributed  random  number  from  a 
distribution  established  by  the  mode  parameters.   It  is 
emphasized  that  these  parameters  and  their  distributions 
are  simply  plausible  estimates  used  to  generate  the  data 
for  this  paper  rehearsal.   The  assumptions  regarding  their 
distribution  need  not  hold  for  the  actual  observations. 
Appropriate  biases  due  to  operator  and  equipment  were  then 
added  to  this  number  to  give  the  observed  ADT.   No  depen- 
dency on  frequency  or  order  of  observation  was  included. 
For  example,  the  first  observation  in  Table  I  was  obtained 
in  the  following  manner.   The  random  observation  equal  to 
0.3  was  chosen  from  a  normal  distribution  centered  at  0.0 
with  a  standard  deviation  of  0.5.   A  bias  of  0.5  dB  corres- 
ponding to  operator  A  was  added  to  the  random  number  to 
arrive  at  an  observed  ADTOf0.8  dB.   The  frequencies  for 
the  various  observations  were  selected  at  random  from  a 
uniform  distribution  over  the  interval  from  ten  to  two 
thousand  Hz,  which  are  the  limits  of  the  system.   This 
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MODES  DT 


Computer  Assisted  Detection  1  (CA1) 
Computer  Assisted  Detection  2  (CA2) 
Operator  Standard  Search  (OSS) 
Operator  Vernier  (OV) 
Operator  Super  Vernier  (OSV) 


Table  II.   Assumed  Means  and  Standard  Deviations  for 
Various  Modes  of  System  Operation 


u 

a 

0.0    dB 

0.5   dB 

0.5    dB 

1.0    dB 

2.0    dB 

2.0    dB 

1.0    dB 

1.0    dB 

0.5    dB 

1.0    dB 

OPERATOR  OPERATOR  BIAS 

A  0 . 5  dB 

B  1.0  dB 

C  0.5  dB 

D  0.0  dB 

E  0.5  dB 

Table  III.   Assumed  Operator  Bias  for  each  of  Five  Operators 
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procedure  was  used  to  generate  the  7  5  observations  of  ADT 

in  Table  I. 

3 .   Analysis  of  the  Data 

The  data  were  then  analyzed  by  a  general  purpose 

analysis  of  variance  (ANOVA)  package  being  developed  in 

APL  by  Prof.  F.  R.  Richards  at  the  Naval  Postgraduate 

School.   The  following  ANOVA  table  summarizes  the  results 

of  this  analysis. 

ANOVA 


SOURCE  OF 

SS 

Dl 

VARIATION 

MAIN  EFFECTS 

MODE 

47.68 

4 

OPERATORS 

21.65 

4 

EQUIPMENT 

3.88 

2 

TWO-WAY 

INTERACTIONS 

MODE-OPERATORS 

9.34 

16 

MODE-EQUIPMENT 

7.5 

8 

RESIDUAL 

50.37 

40 

MS 


11.92 
5.41 
1.94 


0.58  0.46 
0.94  0.75 
1.26 


SIGNIFICANCE 
LEVEL 


9.46     .00002 
4.29     .006 
1.5  4    not  sign. 


not  sign. 
not  sign. 


TOTAL 


140.43     74 


1.9 


Table  IV.   Analysis  of  Variance  Results 
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The  ANOVA  results  indicate  that  there  are  significant 
effects  due  to  the  different  modes  of  operation,  and  due  to 
the  different  operators.   The  Beta  vector,  as  calculated 
by  the  ANOVA  routine,  can  be  used  to  estimate  the  parameters 
of  the  model.   The  model  is  now  simplified  to  include  only 
those  effects  found  to  be  significant  by  the  ANOVA: 


ADT   =   B  +  M.  +  OP  .  +  e.  ., 
1     j  ink 


The  parameter  estimates,  as  calculated  for  this  example, 
are  listed  in  Table  V. 

B  =  0.3 

MODE  EFFECTS  OPERATOR  EFFECTS 

CA1   =  -0.2  dB  A   =   1.1  dB 

CA2   =0.1  dB  B   =   1.5  dB 

OSS   =   2.0  dB  C   =   1.1  dB 

OV    =   0.8  dB  D   =   0.5  dB 

OSV   =   0.0  dB  E   =   0.0  dB 


Table  V.   Parameter  Estimates  for  the  Intercept 
and  Mode  and  Operator  Effects  of  the 
DT  Model 


The  assumptions  made  in  the  ANOVA  calculations  make 
these  estimates  of  factor  effects  valid  only  when  used  in 
conjunction  with  the  other  effects  and  are  not  individually 
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meaningful.   In  order  to  compare  the  estimated  effects  to 
the  inputs  used  to  generate  the  ADT  observations  an  adjust- 
ment must  be  made  which  adjusts  to  zero  those  effects 
which  were  input  with  a  mean  of  zero.   The  estimate  for 
the  CA1  effect  is  added  to  all  mode  estimates  which  makes 
the  CA1  effect  zero  and  thus  the  same  as  the  mean  of  the 
input  distribution.   The  other  adjusted  effects  can  then 
be  compared  to  their  mean  input  values.   A  similar  proce- 
dure' is  carried  out  with  the  operator  effects  where  the 
D  value  is  subtracted  from  each  estimate. 


MODES 

INPUT 

ADJ.  MODEL 

CA  1 

0.0 

0.0 

CA  2 

0.5 

0.4 

OSS 

2.0 

2.2 

ov 

1.0 

1.0 

osv 

0.5 

0.2 

OPERATORS 

INPUT 

ADJ.  MODEL 

A   0.5 

0.6 

B   1.0 

1.1 

C   0.5 

0.6 

D   0.0 

0.0 

E   0.5 

-0.5 

Table  VI.   Comparison  of  Input  Effects  with 
Adjusted  Model  Results 


It  is  apparent  that  these  adjusted  estimates  agree 
very  closely  with  the  inputs,  the  one  exception  being  the 
estimate  for  operator  E  where  a  deviation  of  1  dB  occurs. 

The  fact  that  the  individual  mode  or  operator  effects 
are  not  unique  except  to  within  a  linear  transformation  does 
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not  affect  the  performance  of  the  model  in  estimating 
expected  ADT  values.   For  example,  if  an  estimate  of  DT 
is  required  for  operator  C  using  any  of  the  equipments 
in  mode  OSS,  the  following  calculations  are  made. 


ADT   =   B  +  M     +  OP 
OSS       c 


=   0.3+2.0+1.1   =   3.4  dB 


If  DT   for  mode  OSS  at  the  appropriate  frequency  is  -6  dB, 


then, 


DT   =   -6  dB  +  3.4  dB   =   -2.6  dB 

is  the  DT  value  established  by  the  model  for  this  combination 
of  factors. 

4 .   Comparison  of  Field  Results  with  the  Model 

The  detection  events  occurring  during  the  portion 
of  the  evaluation  in  a  particular  operating  area  were 
recorded  as  indicated  by  the  data  collection  sheet,  Fig.  2. 
An  example  of  the  data  reduction  required  to  determine  the 
SE  at  detection  follows.   The  data  recorded  on  the  collection 
sheet  in  Fig.  2  are  used  for  this  example. 

Reconstruction  of  the  mission  yields  a  range  at 
detection  of  2  50  0  yards.   The  bow  of  the  submarine  was 
towards  the  detecting  bouy  indicating  a  SL  of  160  dB.   From 
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the  block  data  portion  of  an  ICAPS  propagation  loss  routine, 
the  TL  at  the  detected  frequency  is  found  to  be  8  0.9  dB  at 
a  range  of  2500  yards. 

The  DT  as  determined  by  the  model  for  the  mode, 
equipment  and  operator  involved  and  a  DT   of  -5  dB  is: 

DT   =   DT   +  B  +  M     +  OP,_ 
c        oss     b 

=   -5  +  0.3  +  2.0  +  1.5 
=   -1.2 

From  this  information  the  FOM  is  determined. 

FOM   =   SL  -  (AN  -  DT)  -  DT 
=   160  -  (75  -  0)  +  1.2 
=   8  6.2  dB 

The  SE  for  the  detection  is 

SE   =   FOM  -  TL 

=   86.2  -  80.9 
=   5.3  dB 

The  same  procedure  is  repeated  for  each  detection  event  in 
this  ocean  area.   The  resulting  SE  values  are  as  listed  in 


45 


Table  VII.   These  data  were  generated  by  selecting  random 
variates  from  a  normal  (0,3)  population.   Again  the  values 
of  the  population  parameters  and  the  nature  of  the  distribu- 
tion are  only  logical  estimates  and  not  critical  to  the 
analysis  of  actual  observations. 

DET  #  DET  #  DET  # 

1  1.7  9    3.6  17    1.4 

2  -1.5  10    0.1  18   -1.5 

3  1.5  11   -1.1  19    3.7 

4  4.8  12    3.1  20    4.4 

5  -1.0  13    4.0  21   -0.2 

6  -1.4  14   -1.6  22    1.7 

7  1.6  15    3.5  23   -1.2 

8  3.5  16    2.6  24    5.3 

Table  VII.   Observed  Signal  Excess  Values  in  dB 

A  test  of  the  null  hypothesis  that  the  median  of  the 

observed  SE  value  is  equal  to  zero  versus  the  alternate 

hypothesis  that  the  median  is  not  equal  to  zero  is  carried 

out  by  the  method  of  DeGroot  (197  5) .   For  a  probability  of 

type  one  error  (a)  of  0.1,  zero  falls  in  the  interval  between 

the  eighth  and  sixteenth  order  statistics  of  the  observed 

set  of  SE  values.   For  our  data  Y0  =  -1.0  and  Yn _  =  3.1. 

o  lb 

Therefore,  we  cannot  reject  the  null  hypothesis.   We  can 
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assume  that  the  observed  SE  values  do  not  indicate  that 
the  DT  levels  achieved  in  this  ocean  area  are  different 
from  those  derived  from  the  model. 
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V.   CONCLUSIONS 

•  The  most  appropriate  method  for  reporting  the  detection 
capability  of  an  airborne  acoustic  processing  system  is 
through  an  accurate  determination  of  the  system's  detec- 
tion threshold.   The  model  developed  from  observations 
of  detection  performance  under  controlled  conditions 
allows  the  evaluator  to  report  expected  detection  per- 
formance for  operational  scenarios  for  which  predictions 
of  target  source  level  and  transmission  loss  are  available 

•  Successful  accomplishment  of  the  operational  evaluations 
of  acoustic  processors  require  that  those  involved  have 
a  thorough  knowledge  of  the  factors  likely  to  affect 
system  performance.   Without  a  detailed  understanding 

of  the  environment,  operators,  and  other  variables  which 
affect  the  results  of  acoustic  trials,  the  test  direc- 
tor is  unlikely  to  arrive  at  satisfactory  measures  of 
system  effectiveness. 

•  The  operational  test  and  evaluation  of  sophisticated 
acoustic  processors  requires  that  established  experimen- 
tal design  concepts  be  utilized  in  the  preparation  of 
test  plans.   The  high  cost  of  at-sea  trials  do  not  allow 
the  evaluator  to  risk  the  failure  to  achieve  required 
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results  from  the  encounters  available.   A  comprehen- 
sive test  plan  which  includes  the  analysis  methods  to 
be  used  is  the  only  method  of  ensuring  meaningful 
results. 

•  A  properly  exercised  paper  rehearsal  of  the  test  plan 
for  an  operational  evaluation  will  provide  a  good 
indication  of  the  feasibility  of  achieving  the  goals 
of  the  proposed  experiment. 
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